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SEQUENCE LISTING 

<110> TAKEDA, Junji 
HORIE, Kyoji 
YUSA, Kosuke 
ISHIHARA, Hiroshi 

i 

<12 0> DEVELOPMENT OF MAMMALIAN GENOME . MODIFICATION TECHNIQUE 
USING RETROTRANSPOSON 

<13 0> 71004 8 . 401USPC 

<140> US 10/579/936 
<141> 2004-11-1? 

► 

<150> PCT/JP2004/0173 07 
<151> 2004-11-19 

> 

<150> JP 2003-393175 
<151> 2003-11-21 

<160> 47 

<170> Patentln Aversion 3.2 

<210> 1 

<211> 7318 

<212> DNA 

<213> Mus musculus 

<220> 

<223> IAP sequence actually used in the Examples for exemplification 
<220> 

<221> CDS 

<222> (670) . . (2427) 

<220> 

<221> CDS 

<222> (2430) . . (3203) 
<220> 

<221> CDS 

<222> (3440) . . (5854) 
<400> 1 

tgtgggaagc cgcccccaca ttcgccgtca caagatggtg ctgacatcct gtgttctaag 60 
ttggtaaaca aataatctgc gcatgagcca agggtattta cgactacttg. tactctgttt 12 0 
ttcccgtgaa cgtcagctcg gccatgggct gcagccaatc agggagtgat gcgccctagg 180 
caatggttgt tctctt:taaa atagaagggg tttcgttttt ctcgctctct tgcttcttgc 240 
tctctcttgc ttccctctct tgcttcttgc tctctcttgc ttccctctct tgcttcttgc 300 
tctctcttgc ttccctctct tgcttcttgc tctctcttgc ttcttgctct cttttcctga 360 
agatgtaaga ataaagcttt gtcgcagaag attctggtct gtggtgttct tcctggccgg 420 
tcgtgagaac gcgtcgaata acaattggtg ccgaattccg ggacgagaaa atccgggacg 480 



I 

i 



2 



agaaaaaact ccggactggc gcaggaggga tacttcattt cagaaccaga actacggatc 
acgtttataa aggttcccgt aacacagact gttgagaagg attcaactgc cgaattcaga 
actcatcagc tggggaacga cggtgataaa ggttcccgta aagcagactg ttaaaaagga 
ttcaactgt atg aat tea gaa ctt ttc age tgg gga acg aga gta cca gtg 

Met Asn Ser Glu Leu Phe Ser Trp Gly Thr Arg Val Pro Val 

1 5 10 



540 
600 
660 
711 



agt atg ttt ggc ctt gaa ttt ttt ctg gtg tta gga gec ctt ttg ttc 
Ser Met Phe Gly Leu Glu Phe Phe Leu Val Leu Gly Ala Leu Leu Phe 
15 20 25 30 



759 



ctt ttc aca tgttat ata gtg gtt aag gca ggg ctg aaa att ctg gat 
Leu Phe Thr Cys Tyr lie Val Val Lys Ala Gly Leu Lys lie Leu Asp 

35 40 45 



807 



gaa att cag ggc lagt eta tea gaa gta aag egg gga gag aga gta gga 
Glu lie Gin Gly Ser Leu Ser Glu Val Lys Arg Gly Glu Arg Val Gly 

50 55 60 



855 



gca agg aga aac 
Ala Arg Arg Asn 
65 



ggt aag tat aca ggc ctt tec aag ggt ctt gaa ccc 
Gly Lys Tyr Thr Gly Leu Ser Lys Gly Leu Glu Pro 

70 75 



903 



gag gaa aag tta agg tta ggt agg aat acc tgg aga gag att aga aga 
Glu Glu Lys Leu Arg Leu Gly Arg Asn Thr Trp Arg Glu lie Arg Arg 
80 85 90 



951 



aaa aga gga aaa agg gaa aag aaa aaa gat cga tta gcg gag gtc tct 
Lys Arg Gly Lys Arg Glu Lys Lys Lys Asp Arg Leu Ala Glu Val Ser 
95 100 105 110 



999 



a< 33 a 9 a tac tc 9 | tea eta gat gag etc agg aag cca get ctt agt age 
Arg Arg Tyr Ser Ser Leu Asp Glu Leu Arg Lys Pro Ala Leu Ser Ser 

115 120 125 



1047 



tct gaa gca agt gaa gaa tec tec tct gag gaa aca gac tgg gag gaa 
Ser Glu Ala Ser ;Glu Glu Ser Ser Ser Glu Glu Thr Asp Trp Glu Glu 

130 '. 135 140 



1095 



gaa gca gec cat ,tac cag cca get aat tgg tea aga aaa aag cca aaa 
Glu Ala Ala His ; Tyr Gin Pro Ala Asn Trp Ser Arg Lys Lys Pro Lys 
145 150 155 



1143 



gcg get ggc gaa agt cag cgt act gtt caa cct ccc ggc agt egg ttt 
Ala Ala Gly Glu Ser Gin Arg Thr Val Gin Pro Pro Gly Ser Arg Phe 
160 165 170 



1191 



caa ggt ccg ccc tat gcg gag ccc ccg ccc tgc gta gtg cgt cag caa 1239 
Gin Gly Pro Pro Tyr Ala Glu Pro Pro Pro Cys Val Val Arg Gin Gin 
175 180 185 190 

tgc gca gag agg caa tgc gca gag agg tgc gca gag agg cag tgc gca 1287 
Cys Ala Glu Arg Gin Cys Ala Glu Arg Cys Ala Glu Arg Gin Cys Ala 

195 200 205 



i 



3 



gac agg tgc gca gag agg cag tgc gca gag agg cag tgc gca gac tea 
Asp Arg Cys Ala Glu Arg Gin Cys Ala Glu Arg Gin Cys Ala Asp Ser 

210 215 220 



1335 



ttc att ccc cga gag gaa caa aag aaa ata gaa cag gca ttt cca gtc 
Phe lie Pro Arg Glu Glu Gin Lys Lys lie Glu Gin Ala Phe Pro Val 
225 230 235 



1383 



ttt gaa gga gec gag ggt ggg cgt gtc cac get ccg gta gaa tac gta 
Phe Glu Gly Ala Glu Gly Gly Arg Val His Ala Pro Val Glu Tyr Val 
240 245 250 



1431 



cag att aag gaa att gec gag teg gtt cgt aaa tac gga acc aat get 
Gin lie Lys Glu lie Ala Glu Ser Val Arg Lys Tyr Gly Thr Asn Ala 
255 260 265 270 



1479 



aat ttc acc ttg gtg cag tta gac agg etc get ggt atg gca eta acg 
Asn Phe Thr Leu ; Val Gin Leu Asp Arg Leu Ala Gly Met Ala Leu Thr 

275 280 285 



1527 



cct get gat tgg cag acg gtt gta aaa gec get ctt cct agt atg ggc 
Pro Ala Asp Trp Gin Thr Val Val Lys Ala Ala Leu Pro Ser Met Gly 

290 295 300 



1575 



aaa tat atg gaa 
Lys Tyr Met Glu 
305 



tgg aaa gcg ctt tgg cac gaa get gca cag gcg cag 
Trp Lys Ala Leu Trp His Glu Ala Ala Gin Ala Gin 

310 315 



1623 



gee cga gca aac gca get get ttg act cca gag cag aga gat tgg act 
Ala Arg Ala Asn 'Ala Ala Ala Leu Thr Pro Glu Gin Arg Asp Trp Thr 
320 325 330 



1671 



ttt gac ttg tta acg ggt cag gga get tat tct get gat cag aca aac 
Phe Asp Leu Leu Thr Gly Gin Gly Ala Tyr Ser Ala Asp Gin Thr Asn 
335 340 345 350 



1719 



tac cat tgg gga igct tat gec cag att tct tec acg get att agg gec 
Tyr His Trp Gly Ala Tyr Ala Gin lie Ser Ser Thr Ala lie Arg Ala 

i355 360 365 



1767 



tgg aag gcg etc tct cga gca ggt gaa acc act ggt cag tta aca aaa 
Trp Lys Ala Leu iSer Arg Ala Gly Glu Thr Thr Gly Gin Leu Thr Lys 

370 375 380 



1815 



ata ate cag gga cct cag gaa tct ttc tea gat ttt gtg gec aga atg 
lie lie Gin Gly Pro Gin Glu Ser Phe Ser Asp Phe Val Ala Arg Met 
385 390 395 



1863 



aca gag gca gca gag cgt att ttt gga gag tea gag caa get gcg cct 
Thr Glu Ala Ala Glu Arg lie Phe Gly Glu Ser Glu Gin Ala Ala Pro 
400 405 410 



1911 



etc ata gaa cag eta ate tac gag caa gee aca aag gag tgc cga gcg 



1959 



Leu lie Glu Gin Leu lie Tyr Glu Gin Ala Thr Lys Glu Cys Arg Ala 
415 420 425 430 

i 
i 

gcc ata gcc cca ;aga aag aac aaa ggc tta caa gac tgg etc agg gtc 
Ala lie Ala Pro Arg Lys Asn Lys Gly Leu Gin Asp Trp Leu Arg Val 

435 440 445 

tgt cga gag ctt Iggg gga ccc ctt age aat gca ggt tta gcg get gcc 
Cys Arg Glu Leu Gly Gly Pro Leu Ser Asn Ala Gly Leu Ala Ala Ala 

450 455 460 



2007 



2055 



ate ctt caa tec caa aac cgc tec atg ggc aga aat gat cag agg aca 
lie Leu Gin Ser Gin Asn Arg Ser Met Gly Arg Asn Asp Gin Arg Thr 

465 , 470 475 

i 
I 

tgt ttt aac tgc ! gga aag cct ggg cat ttt aag aaa gat tgc aga get 
Cys Phe Asn Cys [Gly Lys Pro Gly His Phe Lys Lys Asp Cys Arg Ala 
480 485 490 

cca gat aaa cag Igga ggg act etc act ctt tgc tct aag tgt ggc aag 
Pro Asp Lys Gin ;Gly Gly Thr Leu Thr Leu Cys Ser Lys Cys Gly Lys 
495 500 505 510 



2103 



2151 



2199 



ggt tat cat aga jget gac cag tgt cgc tct gtg agg gat ata aag ggc 
Gly Tyr His Arg |Ala Asp Gin Cys Arg Ser Val Arg Asp lie Lys Gly 

1515 520 525 

j 

i 

aga att ctt ccc cca cct gat agt caa tea get gat gtg cca aaa aac 

Arg lie Leu Pro 'Pro Pro Asp Ser Gin Ser Ala Asp Val Pro Lys Asn 

530 535 540 



2247 



2295 



ggg tea ccg 
Gly Ser Pro 
545 

ttt gtc agg 
Phe Val Arg 
560 



ggc jeet egg 
Gly I Pro Arg 



acc 
Thr 



cag gaa 
Gin Glu 



tec cag ggc cct caa aga tat ggg aac egg 
Ser Gin Gly Pro Gin Arg Tyr Gly Asn Arg 
550 555 

gca gtc aga gag acg acc cag gaa gac cca 
Ala Val Arg Glu Thr Thr Gin Glu Asp Pro 
565 570 



2343 



2391 



caa ggg tgg acc 
Gin Gly Trp Thr 
575 



tgc gtg 
Cys Val 
580 



ccg cct ccg act tec tat ta atg cct caa 
Pro Pro Pro Thr Ser Tyr Met Pro Gin 

585 



atg agt att cag |ccg gtg ccg gtg gag cct ata cca tec ttg ccc ccg 
Met Ser lie Gin! Pro Val Pro Val Glu Pro lie Pro Ser Leu Pro Pro 
590 595 600 605 



gga acc atg ggc 
Gly Thr Met Gly 



ctt att etc ggc cga ggt tea etc acc ttg cag ggc 
Leu lie Leu Gly Arg Gly Ser Leu Thr Leu Gin Gly 
610 615 620 



2438 



2486 



2534 



tta gta gtc cac cct gga att atg gat tgt caa cat tec cct gaa ata 
Leu Val Val His Pro Gly lie Met Asp Cys Gin His Ser Pro Glu lie 

625 I 630 635 



2582 



i 



5 



cag gtc ctg 
Gin Val Leu 
640 

gat agg ata 
Asp Arg lie 
655 

tct gca gga 
Ser Ala Gly 
670 

gcc tat ttg 
Ala Tyr Leu 



att aat gga 
lie Asn Gly 



agt ata att 
Ser lie lie 
720 

tea tct cat 
Ser Ser His 
735 

age tec gtt 
Ser Ser Val 
750 

ttc ata cct 
Phe lie Pro 



att atg cag 
lie Met Gin 



gga ggg tat 
Gly Gly Tyr 
800 

aaa gaa gga 
Lys Glu Gly 
815 

ate tea cct 
lie Ser Pro 
830 



tgc tea age 
Cys Ser Ser 

get cag ctg 
Ala Gin Leu 



cct gag ata 
Pro Glu lie 
675 

gtt gta tct 
Val Val Ser 
690 

aaa gag ttt 
Lys Glu Phe 
705 

tct aca cat 
Ser ;Thr His 



tea tta cag 
Ser Leu Gin 



gcc ttg acg 
Ala Leu Thr 
755 

tat ,gtg etc 
Tyr Val Leu 
770 

cat ttg ggc 
His Leu Gly 
785 

tea get aaa 
Ser Ala Lys 



aaa ggg tta 
Lys Gly Leu 

i 

aat gga aac 
Asn Gly Asn 
835 



cct aaa ggc 
Pro Lys Gly 
645 

ctg etc etc 
Leu Leu Leu 
660 

aag aaa atg 
Lys Lys Met 



tta aat gat 
Leu Asn Asp 



gaa ggc ate 
Glu Gly lie 
710 

tgg tgg ccc 

Trp Trp Pro 
725 

ggc eta gga 
Gly Leu Gly 
740 

tgg gaa tec 
Trp Glu Ser 



cca etc ccg 
Pro Leu Pro 



ctt att ttg 
Leu lie Leu 
790 

gca aaa aat 
Ala Lys Asn 
805 

gga cat caa 
Gly His Gin 
820 

caa gac aga 
Gin Asp Arg 



gtt ttt tct 
Val Phe Ser 



cct gat aat 
Pro Asp Asn 
665 

ggc tec tea 
Gly Ser Ser 
680 

aga cct aag 
Arg Pro Lys 
695 

ctt gat ace 
Leu Asp Thr 



aaa gca tgg 
Lys Ala Trp 



^tat caa tea 
Tyr Gin Ser 
745 

tct gaa ggg 
Ser Glu Gly 
760 

gtt aac etc 
Val Asn Leu 

775 

tec aat gaa 
Ser Asn Glu 



ate atg gca 
lie Met Ala 



gaa cag gga 
Glu Gin Gly 
825 

cag ggt ctg 
Gin Gly Leu 
840 



att agt aaa 
lie Ser Lys 
650 

ace agg gag 
Thr Arg Glu 



gga aat gat 
Gly Asn Asp 



etc cgc ctt 
Leu Arg Leu 
700 

gga gca gat 
Gly Ala Asp 
715 

ccc ace aca 
Pro Thr Thr 
730 

tgt ccc act 
Cys Pro Thr 



cag caa ggg 
Gin Gin Gly 



tgg gga agg 
Trp Gly Arg 
780 

aac gcc cca 
Asn Ala Pro 
795 

aag atg ggt 
Lys Mef Gly 
810 

agg ata gag 
Arg lie Glu 



ggt ttt cca 
Gly Phe Pro 



gga 2630 
Gly 

aaa 2 67 8 
Lys 

tct 2726 

Ser 

685 

aag 2 774 
Lys 



aaa 2822 
Lys 



gag 2870 
Glu 



ata 2918 
He 



aaa 2966 

Lys 

765 

gat 3 014 

Asp 



tea 3062 
Ser 



tat 3110 
Tyr 



ccc 3158 
Pro 



3203 



tageggecat tggggcagca cgacccatac catggaaaac 



aggggaccca gtgtgggttc 3263 



6 



ctcaatggca cctatcctct gaaaaactag aagctgtgat tcaactggta gaggaacaat 3323 
taaaattagg ccatattgaa ccctctacct caccttggaa tactccaatt tttgtaatta 3383 
agaaaaagtc aggaaagtgg agactgctcc atgacctcag agccattaat gagcaa atg 3442 

Met 
845 

aac tta ttt ggc cca gta cag agg ggt etc cct gta ett tec gee tta 3490 
Asn Leu Phe Gly Pro Val Gin Arg Gly Leu Pro Val Leu Ser Ala Leu 

850 855 860 

cca cgt ggc tgg aat tta ate att ata gat att aaa gat tgt ttc ttt 3538 
Pro Arg Gly Trp Asn Leu lie lie lie Asp lie Lys Asp Cys Phe Phe 

865 870 875 

tct ata cct ttg tgt cca agg gat agg ccc aga ttt gec ttt acc ate 3586 
Ser lie Pro Leu Cys Pro Arg Asp Arg Pro Arg Phe Ala Phe Thr lie 
880 885 890 

ccc tct att aat cac atg gaa cct gat aag agg tat caa tgg aag gtc 3634 
Pro Ser lie Asn His Met Glu Pro Asp Lys Arg Tyr Gin Trp Lys Val 

895 900 905 

i 

tta cca cag gga atg tec aat agt cct act atg tgt caa ctt tat gta 3682 
Leu Pro Gin Gly ;Met Ser Asn Ser Pro Thr Met Cys Gin Leu Tyr Val 
910 915 920 925 

caa gaa get ctt ttg cca gtg agg gaa caa ttc ccc tct tta att ttg 3730 
Gin Glu Ala Leu Leu Pro Val Arg Glu Gin Phe Pro Ser Leu lie Leu 

930 935 : 940 

etc ctt tac atg gat gac ate etc ctg tgc cat aaa gac ctt acc atg 3778 
Leu Leu Tyr Met Asp Asp lie Leu. Leu Cys His Lys Asp Leu Thr Met 

945 950 955 

eta caa aag gca tat cct ttt eta ctt aaa act tta agt cag tgg ggt 3826 
Leu Gin Lys Ala Tyr Pro Phe Leu Leu Lys Thr Leu Ser Gin Trp Gly 
960 965 970 

eta cag ata gee aca gaa aag gtc caa att tct gat aca gga caa ttc 3874 
Leu Gin lie Ala Thr Glu Lys Val Gin lie Ser Asp Thr Gly Gin Phe 
975 980 985 

ttg 99 c tct gtg gtg tec cca gat aag att gtg ccc caa aag gta gag 3922 
Leu Gly Ser Val Val Ser Pro Asp Lys lie Val Pro Gin Lys Val Glu 
990 995 1000 1005 

ata aga aga gat cac etc cat acc tta aat gat ttt caa aag ctg 3967 
lie Arg Arg Asp 'His Leu His Thr Leu Asn Asp Phe Gin Jjys Leu 

1010 1015 1020 

ttg gga gat att aat tgg etc aga ccc ttt tta aag att cct tct 4012 
Leu Gly Asp lie Asn Trp Leu Arg Pro Phe Leu Lys lie Pro Ser 

1025 1030 1035 



7 



get gaa tta agg cct ttg ttt agt att tta gaa gga gat cct cat 4057 
Ala Glu Leu Arg Pro Leu Phe Ser lie Leu Glu Gly Asp Pro His 

1040 1045 1050 

ate tec tec cct agg act ctt act eta get get aac cag gee tta 4102 
lie Ser Ser Pro Arg Thr Leu Thr Leu Ala Ala Asn Gin Ala Leu 

1055 1060 1065 

caa aaa gtg gaa aaa gee tta cag aat gca caa tta caa cgt att 414 7 

Gin Lys Val Glu Lys Ala Leu Gin Asn Ala Gin Leu Gin Arg lie 

1070 1075 1080 

gag gat teg cag cct ttc agt ttg tgt gtc ttt aag aca gca caa 4192 
Glu Asp Ser Gin Pro Phe Ser Leu Cys Val Phe Lys Thr Ala Gin 

1085 1090 1095 

ttg cca act gca gtt ttg tgg cag aat ggg cca ttg ttg tgg ate 4237 
Leu Pro Thr Ala Val Leu Trp Gin Asn Gly Pro Leu Leu Trp lie 

1100 1105 1110 

cat cca aac gta tec cca get aaa ata ata gat tgg tat cct gat 4282 
His Pro Asn Val Ser Pro Ala Lys lie lie Asp Trp Tyr Pro Asp 

1115 1120 1125 

gca att gca cag ctt gee ctt aaa ggc eta aaa gca gca ate acc 4327 
Ala lie Ala Gin Leu Ala Leu Lys Gly Leu Lys Ala Ala lie Thr 

1130 1135 1140 

cac ttt ggg caa agt cca tat ctt tta att gta cct tat acc get 4372 
His Phe Gly Gin Ser Pro Tyr Leu Leu lie Val Pro Tyr Thr Ala 

1145, 1150 1155 

gca cag gtt caa acc ttg gca gee gca tct aat gat tgg gca gtt 4417 
Ala Gin Val Gin Thr Leu Ala Ala Ala Ser Asn Asp Trp Ala Val 

1160 1165 ' 1170 

tta gtt acc tec ttt tea gga aaa ata gat aac cat tat cca aag 4462 
Leu Val Thr Ser Phe Ser Gly Lys lie Asp Asn His Tyr Pro Lys 

1175 1180 1185 

cat cca ate tta cag ttt gee caa aat caa tct gtt gtg ttt cca 4507 
His Pro lie Leu. Gin Phe Ala Gin Asn Gin Ser Val Val Phe Pro 

1190 1195 1200 

caa ata aca gta aga aac cca ctt aaa aat ggg att gtg gta tat 4552 
Gin lie Thr Val Arg Asn Pro Leu Lys Asn Gly lie Val Val Tyr 

1205 1210 1215 

act gat gga tea aaa act ggc ata ggt gee tat gtg get aat ggt 4597 
Thr Asp Gly Ser Lys Thr Gly lie Gly Ala Tyr Val Ala Asn Gly 

1220 1225 1230 

aaa gtg gta tec aaa cag tat aat gaa aat tea cct caa gtg gta 4642 
Lys Val Val Ser Lys Gin Tyr Asn Glu Asn Ser Pro Gin Val Val 



i 



8 



1235 



1240 



1245 



gaa tgt tta gtg gtc tta gaa gtt tta aaa acc ttt tta aaa ccc 
Glu Cys Leu Val Val Leu Glu Val Leu Lys Thr Phe Leu Lys Pro 

1250 1255 1260 



4687 



ctt aat att gtg tea gat tec tgt tat gtg gta aat gca gta aat 
Leu Asn lie Val Ser Asp Ser Cys Tyr Val Val Asn Ala Val Asn 

1265 1270 1275 



4732 



ctt tta gaa gtg get gga gtg att aag cct tec agt aga gtt gee 
Leu Leu Glu Val Ala Gly Val lie Lys Pro Ser Ser Arg Val Ala 

1280 1285 1290 



4777 



aat att ttt cagicag ata caa tta gtt ttg tta tct aga aga tct 
Asn lie Phe Gin Gin lie Gin Leu Val Leu Leu Ser Arg Arg Ser 

1295 1300 1305 



4822 



cct gtt tat att act cat gtt aga gec cat tea ggc eta cct ggc 
Pro Val Tyr lie Thr His Val Arg Ala His Ser Gly Leu Pro Gly 

1310 1315 1320 



4867 



ccc atg get ctg;gga aat gat ttg gca gat aag gee act aaa gtg 
Pro Met Ala Leu Gly Asn Asp Leu Ala Asp Lys Ala Thr Lys Val 

1325 1330 1335 



4912 



gtg get get gee eta tea tec ccg gta gag get gca aga aat ttt 
Val Ala Ala Ala Leu Ser Ser Pro Val Glu Ala Ala Arg Asn Phe 

1340 1345 1350 



4957 



cat aat aat ttt cat gtg acg get gaa aca tta cgc agt cgt ttc 
His Asn Asn Phe, His Val Thr Ala Glu Thr Leu Arg Ser Arg Phe 

1355 1360 1365 



5002 



tec ttg aca aga; aaa gaa gee cgt gac att gtt act caa tgt caa 
Ser Leu Thr Arg Lys Glu Ala Arg Asp lie Val Thr Gin Cys Gin 

1370 1375 1380 

age tgc tgt gag! ttc ttg cca gtt cct cat gtg gga att aac cca 
Ser Cys Cys Glu Phe Leu Pro Val Pro His Val Gly lie Asn Pro 

1385 1390 1395 



5047 



5092 



cgc ggt att cgacct eta cag gtc tgg caa atg gat gtt aca cat 
Arg Gly lie Arg j Pro Leu Gin Val Trp Gin Met Asp Val Thr His 

1400 1405 1410 



5137 



gtt tct tec ttt gga aaa ctt caa tat etc cat gtg tec att gac 



Val Ser Ser Phe 



aca tgt tct ggc 
Thr Cys Ser Gly 



Gly Lys Leu Gin Tyr Leu His Val Ser lie Asp 



1415 



1420 



1425 



ate atg ttt get tct ccg tta act gga gaa aaa 
lie Met Phe Ala Ser Pro Leu Thr Gly Glu Lys 
1430 1435 1440 



5182 



5227 



• 



gcc tea cat 
Ala Ser His 



ggg aaa ccc 
Gly Lys Pro 

tec caa aaa 
Ser Gin Lys 



ctg act gga 
Leu Thr Gly 



cgt gcg cat 
Arg Ala His 



gga act ttt 
Gly Thr Phe 



tct ttg gca 
Ser Leu Ala 



ggc cat act 
Gly His Thr 



aat gag atg 
Asn Glu Met 



ggc ccg gat 
Gly Pro Asp 



ttc cca cag 
Phe Pro Gin 



acc cga aaa 
Thr Arg Lys 



ctt ggt gat 
Leu Gly Asp 



ggg gat aat 
Gly Asp Asn 



gtg att caa 
Val ; He Gin 

; 1445 

aga : etc ctt 
Arg Leu Leu 
1460 

ttt cag cag 
Phe 5 Gin Gin 
' 1475 

ctt cca tac 
Leu Pro Tyr 
1490 

cgc acc etc 
Arg Thr Leu 
1505 

gag gag act 
Glu Glu Thr 
; 1520 

etc ttt aca 
Leu , Phe Thr 
1535 

gcg get gaa 
Ala Ala Glu 
1550 

gtt aaa tgg 
Val Lys Trp 
1565 

cct ate ttg 
Pro He Leu 
1580 

aat gaa gac 
Asn Glu Asp 
1595 

ate cag act 
He Gin Thr 
1610 

gtc cag ggc 
Val Gin Gly 
1625 

gtc gac att 
Val Asp He 



cat tgt ctt 
His Cys Leu 



aag act gat 
Lys Thr Asp 



ttc tgc cgt 
Phe Cys Arg 



aac cct caa 
Asn Pro Gin 



aaa gcc tat 
Lys Ala Tyr 



gta ccc cga 
Val Pro Arg 



etc aat ttt 
Leu Asn Phe 



cgt cat tgt 
Arg His Cys 



aaa aat gtc 
Lys Asn Val 



ata aga tec 
He Arg Ser 



aac cca ttt 
Asn Pro Phe 



gac caa ggg 
Asp Gin Gly 



gtc aat aat 
Val Asn Asn 



tec act ccc 
Ser Thr Pro 



gag gca tgg 
Glu Ala Trp 
1450 

aat gga cca 
Asn Gly Pro 
1465 

cag atg gac 
Gin Met Asp 
1480 

gga cag ggt 
Gly Gin Gly 
1495 

ctt ata aaa 
Leu He Lys 
1510 

gca cca aga 
Ala Pro Arg 
1525 

tta aat att 
Leu Asn He 
1540 

tea gag cca 
Ser Glu Pro 
1555 

ctt gat aat 
Leu Asp Asn 
1570 

agg gga get 
Arg Gly Ala 
1585 

tgg gta cca 
Trp Val Pro 
1600 

aat act aat 
Asn Thr Asn 
1615 

aaa gag aga 
Lys Glu Arg 
1630 

aat gac ggt 
Asn Asp Gly 



agt get tgg 
Ser Ala Trp 
1455 

get tat acg 
Ala Tyr Thr 
1470 

gta acc cac 
Val Thr His 
1485 

att gtt gag 
He Val Glu 
1500 

cag aag agg 
Gin Lys Arg 
1515 

gtg teg gtg 
Val Ser Val 
1530 

gat get cat 
Asp Ala His 
1545 

gat agg ccc 
Asp Arg Pro 
1560 

aaa tgg tat 
Lys Trp Tyr 
1575 

ate tgt gtt 
He Cys Val 
1590 

gaa aga etc 
Glu Arg Leu 
1605 

gtc cct cgt 
Val Pro Arg 
1620 

gca gcg ttg 
Ala Ala Leu 
1635 

gat gta 
Asp Val 



i 
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1640 1645 

taatgctcaa gtattctcct gcttttttac cactaactag gaactgggtt; tggccttaat 5914 

tcagacagcc ttggctctgt ctggacaggt ccagatgact gacaccatta acactttgtc 5974 

agcctcagtg actacagtca tagatgaaca ggcctcagct aatgtcaaga tacagagagg 6034 

tctcatgctg gttaatcaac tcatagatct tgtccagaaa caactggatg tattatgaca 6094 

aataactcag cagggatgtg aacaaaagtt tccgggattg tgtgttattt ccattcagta 6154 

tgttaaattt actaggacag ctaatttgtc aaaaagtctt tttcagtata tgttacagaa 6214 

ttggatggct gaatttgaat agatccttcg agaattgaga cttcaggtca actccacgcg 6274 

cttggacctg tcgctgacca aaagattacc caattggatc tcctcagcat tttctttctt 6334 

taaaaaattg ggtgggatta atattatttg gagatacact ttgctgtgga ttagtgttgc 63 94 

ttctttgatt ggtctgtaag cttaaggccc aaactaagag agacaaggtg gttattgccc 64 54 

aggcgcttgc aggactagaa catggagctt cccctgatat atctatgctt aagcaatagg 6514 

tcgctggcca ctcagctctt atatctcacg aggctagtct cattgcacga gatagagtga 6574 

gtgtgcttca gcagcccgag agagttgcaa ggctaagcac tgcagtagaa gggctctgcg 6634 

gcacatatga gcctattcta gggagacatg tcatctttca tgaaggttca gtgtcctagt 6694 

tcccttcccc caggcaaaac gacacgggag caggtcaggg ttgctctggg taaaagcctg 6754 

taagcctaag agctaatcct gtacatggct cctttaccta cacactgggg atttgacctc 6814 

tatctccact ctcattaata tgggtggcct atttgctctt attaaaagaa aaagggggaa 6874 

ctgtgggaag ccgcccccac attcgccgtc acaagatggc gctgacatcc tgtgttctaa 6934 

gttggtaaac aaataatctg cgcatgagcc aagggtattt acgactactt gtactctgtt 6994 

tttcccgtga acgtcagctc ggccatgggc tgcagccaat cagggagtga. tgcgccctag 7054 

gcaatggttg ttctctttaa aatagaaggg gtttcgtttt tctcgctctc ttgcttcttg 7114 

ctctctcttg cttccctctc ttgcttcttg ctctctcttg cttccctctc ttgcttcttg 7174 

ctctctcttg cttccctctc ttgcttcttg ctctctcttg cttcttgctc tcttttcctg 7234 

aagatgtaag aataaagctt tgtcgcagaa gattctggtc tgtggtgttc ttcctggccg 72 94 

gtcgtgagaa cgcgtcgaat aaca 7318 



<210> 2 

<211> 586 

<212> PRT 

<213> Mus musculus 

<220> 

<223> IAP sequence amino acici sequence (gag #1) 

<400> 2 



Met 


Asn 


Ser 


Glu Leu 


Phe 


Ser 


Trp 


Gly 


Thr Arg 


Val 


Pro 


Val Ser Met 


1 






5 










10 






15 


Phe 


Gly 


Leu 


Glu Phe 


Phe 


Leu 


Val 


Leu 


Gly Ala 


Leu 


Leu 


Phe Leu Phe 








20 








25 








30 


Thr 


Cys 


Tyr 


He Val 


Val 


Lys 


Ala 


Gly 


Leu Lys 


He 


Leu 


Asp Glu He 






35 








40 








45 




Gin 


Gly 


Ser 


Leu , Ser 


Glu 


Val 


Lys 


Arg 


Gly Glu 


Arg 


Val 


Gly Ala Arg 




50 








55 








60 






Arg 


Asn 


Gly 


Lys Tyr 


Thr 


Gly 


Leu 


Ser 


Lys Gly 


Leu 


Glu 


Pro Glu Glu 


65 








70 








75 






80 


Lys 


Leu 


Arg 


Leu Gly 


Arg 


Asn 


Thr 


Trp 


Arg Glu 


He 


Arg Arg Lys Arg 








85 










90 






95 


Gly 


Lys 


Arg 


Glu Lys 


Lys 


Lys 


Asp 


Arg 


Leu Ala 


Glu 


Val 


Ser Arg Arg 



100 105 110 
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Tyr Ser Ser Leu j Asp Glu Leu Arg Lys Pro Ala Leu Ser Ser Ser Glu 

115 120 125 

Ala Ser Glu Glu Ser Ser Ser Glu Glu Thr Asp Trp Glu Glu Glu Ala 
130 135 140 

Ala His Tyr Gin Pro Ala Asn Trp Ser Arg Lys Lys Pro Lys Ala Ala 
145 150 155 160 

Gly Glu Ser Gin Arg Thr Val Gin Pro Pro Gly Ser Arg Phe Gin Gly 

165 170 175 

Pro Pro Tyr Ala, Glu Pro Pro Pro Cys Val Val Arg Gin Gin Cys Ala 

180 185 190 

Glu Arg Gin Cys Ala Glu Arg Cys Ala Glu Arg Gin Cys Ala Asp Arg 
195 200 205 

Cys Ala Glu Arg Gin Cys Ala Glu Arg Gin Cys Ala Asp Ser Phe lie 
210 215 220 

Pro Arg Glu Glu Gin Lys Lys lie Glu Gin Ala Phe Pro Val Phe Glu 
225 230 235 240 

Gly Ala Glu Gly Gly Arg Val His Ala Pro Val Glu Tyr Val Gin lie 

245 250 255 

Lys Glu lie Ala Glu Ser Val Arg Lys Tyr Gly Thr Asn Ala Asn Phe 

260 265 270 

Thr Leu Val Gin Leu Asp Arg Leu Ala Gly Met Ala Leu Thr Pro Ala 
275 280 285 

Asp Trp Gin Thr Val Val Lys Ala Ala Leu Pro Ser Met Gly Lys Tyr 
290 295 300 

Met Glu Trp Lys Ala Leu Trp His Glu Ala Ala Gin Ala Gin Ala Arg 
305 310 315 320 

Ala Asn Ala Ala Ala Leu Thr Pro Glu Gin Arg Asp Trp Thr Phe Asp 

325 330 335 

Leu Leu Thr Gly Gin Gly Ala Tyr Ser Ala Asp Gin Thr Asn Tyr His 

340 345 350 

Trp Gly Ala Tyr Ala Gin lie Ser Ser Thr Ala lie Arg Ala Trp Lys 
355 360 365 

Ala Leu Ser Arg Ala Gly Glu Thr Thr Gly Gin Leu Thr Lys lie lie 
370 375 380 

Gin Gly Pro Gin Glu Ser Phe Ser Asp Phe Val Ala Arg Met Thr Glu 
385 390 395 400 

Ala Ala Glu Arg lie Phe Gly Glu Ser Glu Gin Ala Ala Pro Leu lie 
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405 410 415 

Glu Gin Leu lie Tyr Glu Gin Ala Thr Lys Glu Cys Arg Ala Ala lie 

420 425 430 

Ala Pro Arg Lys Asn Lys Gly Leu Gin Asp Trp Leu Arg Val Cys Arg 
435 440 445 

Glu Leu Gly Gly Pro Leu Ser Asn Ala Gly Leu Ala Ala Ala lie Leu 
450 .455 460 

Gin Ser Gin Asn Arg Ser Met Gly Arg Asn Asp Gin Arg Thr Cys Phe 
465 470 475 480 

Asn Cys Gly Lys Pro Gly His Phe Lys Lys Asp Cys Arg Ala Pro Asp 

485 490 ' 495 

Lys Gin Gly Gly Thr Leu Thr Leu Cys Ser Lys Cys Gly Lys Gly Tyr 

500 505 510 

His Arg Ala Asp Gin Cys Arg Ser Val Arg Asp lie Lys Gly Arg lie 
515 520 525 

Leu Pro Pro Pro Asp Ser Gin Ser Ala Asp Val Pro Lys Asn Gly Ser 
530 535 540 

Pro Gly Pro Arg Ser Gin Gly Pro Gin Arg Tyr Gly Asn Arg Phe Val 
545 550 555 560 

Arg Thr Gin Glu i Ala Val Arg Glu Thr Thr Gin Glu Asp Pro Gin Gly 

565 570 575 

Trp Thr Cys Val Pro Pro Pro Thr Ser Tyr 

580 585 



<210> 3 

<211> 258 

<212> PRT 

<213> Mus musculus 

<220> 

<223> IAP sequence amino acid sequence (gag #2) 
<400> 3 

Met Pro Gin Met Ser lie Gin Pro Val Pro Val Glu Pro lie Pro Ser 
15 10 15 

Leu Pro Pro Gly Thr Met Gly Leu lie Leu Gly Arg Gly Ser Leu Thr 

20 25 30 

Leu Gin Gly Leu Val Val His Pro Gly lie Met Asp Cys Gin His Ser 
35 40 45 
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Pro Glu lie Gin; Val Leu Cys Ser Ser Pro Lys Gly Val Phe Ser lie 
50 55 60 

Ser Lys Gly Asp Arg lie Ala Gin Leu Leu Leu Leu Pro Asp Asn Thr 
65 70 75 80 



Arg Glu Lys Ser Ala Gly Pro Glu lie Lys Lys Met Gly Ser Ser Gly 

85 90 95 

Asn Asp Ser Ala ( Tyr Leu Val Val Ser Leu Asn Asp Arg Pro Lys Leu 

100 105 110 

Arg Leu Lys lie Asn Gly Lys Glu Phe Glu Gly lie Leu Asp Thr Gly 

115 120 125 

i 

i 

Ala Asp Lys Ser lie lie Ser Thr His Trp Trp Pro Lys Ala Trp Pro 
130 135 140 

Thr Thr Glu Ser Ser His Ser Leu Gin Gly Leu Gly Tyr Gin Ser Cys 
145 150 155 160 

Pro Thr lie Ser Ser Val Ala Leu Thr Trp Glu Ser Ser Glu Gly Gin 

165 170 175 

Gin Gly Lys Phe lie Pro Tyr Val Leu Pro Leu Pro Val Asn Leu Trp 

180 185 190 

Gly Arg Asp lie Met Gin His Leu Gly Leu lie Leu Ser Asn Glu Asn 
195 200 205 

Ala Pro Ser Gly Gly Tyr Ser Ala Lys Ala Lys Asn lie Met Ala Lys 
210 215 220 

Met Gly Tyr Lys , Glu Gly Lys Gly Leu Gly His Gin Glu Gin Gly Arg 
225 230 235 240 

lie Glu Pro lie Ser Pro Asn Gly Asn Gin Asp Arg Gin Gly Leu Gly 

245 250 255 

Phe Pro 



<210> 4 

<211> 805 

<212> PRT 

<213> Mus musculus 

<220> 

<22 3> IAP sequence amino acid sequence (pol) 

<400> 4 



I 
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Met Asn Leu Phe Gly Pro Val Gin Arg Gly Leu Pro Val Leu Ser Ala 
15 10 15 

Leu Pro Arg Gly Trp Asn Leu lie lie lie Asp lie Lys Asp Cys Phe 

20 . 25 30 

Phe Ser lie Pro Leu Cys Pro Arg Asp Arg Pro Arg Phe Ala Phe Thr 
35 40 45 

lie Pro Ser lie Asn His Met Glu Pro Asp Lys Arg Tyr Gin Trp Lys 
50 55 60 

Val Leu Pro Gin Gly Met Ser Asn Ser Pro Thr Met Cys Gin Leu Tyr 
65 70 75 80 

Val Gin Glu Ala Leu Leu Pro Val Arg Glu Gin Phe Pro Ser Leu lie 

85 90 , 95 

Leu Leu Leu Tyr Met Asp Asp lie Leu Leu Cys His Lys Asp Leu Thr 

100 105 110 

Met Leu Gin Lys Ala Tyr Pro Phe Leu Leu Lys Thr Leu Ser Gin Trp 
115 120 125 

Gly Leu Gin lie Ala Thr Glu Lys Val Gin lie Ser Asp Thr Gly Gin 
130 135 140 

{ 

Phe Leu Gly Ser Val Val Ser Pro Asp Lys lie Val Pro Gin Lys Val 
145 150 155 160 

Glu lie Arg Arg Asp His Leu His Thr Leu Asn Asp Phe Gin Lys Leu 

165 170 175 

Leu Gly Asp lie Asn Trp Leu Arg Pro Phe Leu Lys lie Pro Ser Ala 

180 185 190 



Glu Leu Arg Pro Leu Phe Ser lie Leu Glu Gly Asp Pro His lie Ser 
195 200 205 

Ser Pro Arg Thr Leu Thr Leu Ala Ala Asn Gin Ala Leu Gin Lys Val 
210 215 220 

Glu Lys Ala Leu Gin Asn Ala Gin Leu Gin Arg lie Glu Asp Ser Gin 
225 230 235 240 

Pro Phe Ser Leu Cys Val Phe Lys Thr Ala Gin Leu Pro Thr Ala Val 

245 • 250 255 

Leu Trp Gin Asn: Gly Pro Leu Leu Trp lie His Pro Asn Val Ser Pro 

260 265 270 

Ala Lys lie lie Asp Trp Tyr Pro Asp Ala lie Ala Gin Leu Ala Leu 
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275 



280 



285 



Lys Gly Leu Lys Ala Ala lie Thr His Phe Gly Gin Ser Pro Tyr Leu 
290 295 300 

Leu lie Val Pro Tyr Thr Ala Ala Gin Val Gin Thr Leu Ala Ala Ala 
305 310 315 320 

Ser Asn Asp Trp Ala Val Leu Val Thr Ser Phe Ser Gly Lys lie Asp 

325 330 335 

Asn His Tyr Pro Lys His Pro lie Leu Gin Phe Ala Gin Asn Gin Ser 

340 345 350 

Val Val Phe Pro Gin lie Thr Val Arg Asn Pro Leu Lys Asn Gly lie 
355 360 365 

Val Val Tyr Thr Asp Gly Ser Lys Thr Gly lie Gly Ala Tyr Val Ala 
370 375 380 

Asn Gly Lys Val Val Ser Lys Gin Tyr Asn Glu Asn Ser Pro Gin Val 
385 390 395 400 

Val Glu Cys Leu Val Val Leu Glu Val Leu Lys Thr Phe Leu Lys Pro 

405 410 415 

Leu Asn lie Val Ser Asp Ser Cys Tyr Val Val Asn Ala Val Asn Leu 

420 425 430 

* 

Leu Glu Val Ala Gly Val lie Lys Pro Ser Ser Arg Val Ala Asn lie 
435 440 445 

Phe Gin Gin lie Gin Leu Val Leu Leu Ser Arg Arg Ser Pro Val Tyr 
450 455 460 

lie Thr His Val Arg Ala His Ser Gly Leu Pro Gly Pro Met Ala Leu 
465 470 475 480 

Gly Asn Asp Leu Ala Asp Lys Ala Thr Lys Val Val Ala Ala Ala Leu 

485 490 495 

Ser Ser Pro Val Glu Ala Ala Arg Asn Phe His Asn Asn Phe His Val 

500 : 505 510 

Thr Ala Glu Thr Leu Arg Ser Arg Phe Ser Leu Thr Arg Lys Glu Ala 
515 520 525 

Arg Asp lie Val Thr Gin Cys Gin Ser Cys Cys Glu Phe Leu Pro Val 
530 535 540 

Pro His Val Gly lie Asn Pro Arg Gly lie Arg Pro Leu Gin Val Trp 
545 550 555 560 



Gin Met Asp Val Thr His Val Ser Ser Phe Gly Lys Leu Gin Tyr Leu 
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565 570 575 

His Val Ser lie Asp Thr Cys Ser Gly lie Met Phe Ala Ser Pro Leu 

580 585 590 

Thr Gly Glu Lys Ala Ser His Val lie Gin His Cys Leu Glu Ala Trp 
595 600 605 

Ser Ala Trp Gly Lys Pro Arg Leu Leu Lys Thr Asp Asn Gly Pro Ala 
610 615 620 

Tyr Thr Ser Gin Lys Phe Gin Gin Phe Cys Arg Gin Met Asp Val Thr 
625 630 635 640 

His Leu Thr Gly Leu Pro Tyr Asn Pro Gin Gly Gin Gly lie Val Glu 

645 650 655 

Arg Ala His Arg Thr Leu Lys Ala Tyr Leu lie Lys Gin Lys Arg Gly 

660 665 670 

Thr Phe Glu Glu Thr Val Pro Arg Ala Pro Arg Val Ser Val Ser Leu 
675 680 685 

Ala Leu Phe Thr Leu Asn Phe Leu Asn lie Asp Ala His Gly His Thr 
690 695 700 

Ala Ala Glu Arg His Cys Ser Glu Pro Asp Arg Pro Asn Glu Met Val 
705 710 715 720 

Lys Trp Lys Asn Val Leu Asp Asn Lys Trp Tyr Gly Pro Asp Pro lie 

725 730 *' 735 

Leu lie Arg Ser Arg Gly Ala lie Cys Val Phe Pro Gin Asn Glu Asp 

740 745 750 

Asn Pro Phe Trp Val Pro Glu Arg Leu Thr Arg Lys lie Gin Thr Asp 
755 760 765 

Gin Gly Asn Thr Asn Val Pro Arg Leu Gly Asp Val Gin Gly Val Asn 
770 775 780 

Asn Lys Glu Arg Ala Ala Leu Gly Asp Asn Val Asp lie Ser Thr Pro 
785 790 795 800 

Asn Asp Gly Asp Val 

805 



<210> 5 

<211> 673 

<212> DNA 

<213> Mus musculus 
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<223> CMV promoter sequence 
<400> 5 

tggccattgc atacgttgta tccatatcat aatatgtaca tttatattgg ctcatgtcca 60 

acattaccgc catgttgaca ttgattattg actagttatt aatagtaatc aattacgggg 12 0 

tcattagttc atagcccata tatggagttc cgcattacat aacttacggt aaatggcccg 180 

cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata 240 

gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc 300 

cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac 360 

ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg 420 

cagtacatct acgtattagt catcgctatt accatggtga tgcggttttg gcagtacatc 480 

aatgggcgtg gatagcggtt tgactcacgg ggatttccaa gtctccaccc cattgacgtc 540 

aatgggagtt tgttttggca ccaaaatcaa cgggactttc caaaatgtcg taacaactcc 600 

gccccattga cgcaaatggg cggtaggcgt gtacggtggg aggtctatat aagcagagct 660 

cgtttagtga acc 673 



<210> 6 

<211> 655 

<212> DNA 

<213> Mus musculus 

<220> 

<223> CA1 promoter sequence (without the R region and with two bases 
deletion in the promoter region in addition thereto) 

<400> 6 

attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 60 

atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 120 

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 180 

tccattgacg tcaatgggtg gactatttac ggtaaactgc ccacttggca gtacatcaag 24 0 

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 3 00 

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 360 

tcatcgctat taccatgggt cgaggtgagc cccacgttct gcttcactct ccccatctcc 420 

cccccctccc cacccccaat tttgtattta tttatttttt aattattttg tgcagcgatg 480 

ggggcggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg 54 0 

ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa' agtttccttt 600 

tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcg 655 



<210> 7 

<211> 657 

<212> DNA 

<213> Mus musculus 

<220> 

<22 3> CA2 promoter sequence (without the R region) 

<400> 7 

attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 60 

atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 120 

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 180 

tccattgacg tcaatigggtg gactatttac ggtaaactgc ccacttggca gtacatcaag 24 0 

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 300 
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attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 360 

tcatcgctat tacciatgggt cgaggtgagc cccacgttct gcttcactct ccccatctcc 420 

cccccctccc cacccccaat tttgtattta tttatttttt aattattttg tgcagcgatg 480 

ggggcggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg 54 0 

ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa' agtttccttt 600 

tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcggg 657 



<210> 8 
<211> 278 
<212> DNA 

<213> Gallus gallus 
<220> 

<223> avian beta-actin promoter sequence 
<400> 8 

tcgaggtgag ccccacgttc tgcttcactc tccccatctc ccccccctcc ccacccccaa 60 
ttttgtattt atttattttt taattatttt gtgcagcgat gggggcgggg gggggggggg 120 
cgcgcgccag gcggggcggg gcggggcgag^ gggcggggcg gggcgaggcg gagaggtgcg 180 
gcggcagcca atcagagcgg cgcgctccga aagtttcctt ttatggcgag gcggcggcgg 240 
cggcggccct ataaaaagcg aagcgcgcgg cgggcggg 278 



<210> 9 

<211> 41 

<212> DNA 

<213> Mus musculus 

<220> 

<223> forward primer sequence for isolation of the IAP element used 
in Example 1 

<400> 9 

gcagcggccg ccgtggtggc acacactttt agtccccgca g 41 



<210> 10 

<211> 41 

<212> DNA 

<213> Mus musculus 

<220> 

<223> reverse primer sequence for isolation of the IAP element used 
in Example 1 

<400> 10 

ggcgcactag tgatgccctc tcaggcctcc actcaggcac t 41 



<210> 11 

<211> 30 

<212> DNA 

<213> Mus musculus 



V 
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<220> 

<223> forward primer sequence for isolation of the full length of the IAP 
element used in Example 1 



<400> 11 

atgcccagat ttcttccacg gctattaggg 



30 



<210> 12 

<211> 30 

<212> DNA 

<213> Mus musculus 



<220> 
<223> 



reverse primer sequence for isolation of the full length of the IAP 
element used in Example 1 



<400> 12 

gatgccctct caggcctcca ctcaggcact 



30 



<210> 13 

<211> 40 

<212> DNA 

<213> Mus 



musculus 



<220> 

<223> forward primer sequence related to the CMV promoter used 
in Example 1 (c) 

<400> 13 

ccaagcggcc gctggccatt gcatacgttg tatccatatc 40 

i 

t 

<210> 14 

<211> 40 

<212> DNA 

<213> Mus musculus 

<220> 

<223> reverse primer sequence related to the CMV promoter used 
in Example 1 (c) 

<400> 14 

gcgagaaaaa cggttcacta aacgagctct gcttatatag 4 0 



<210> 15 

<211> 30 

<212> DNA 

<213> Mus 

<220> 



musculus 
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<223> forward primer sequence related to the R region of the IAP used 
in Example 1 (c) 

<400> 15 

ttagtgaacc gtttttctcg ctctcttgct 30 



<210> 16 

<211> 30 

<212> DNA 

<213> Mus musculus 

<220> 

<22 3> reverse primer sequence related to the R region of the IAP used 
in Example 1 (c) 

<400> 16 

tctgaaatga agtatccctc ctgcgccagt 30 



<210> 17 

<211> 63 ' 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a linking sequence of a linker DNA used in Example 3 

<400> 17 

cgaatcgtaa ccgttcgtac gagaattcgt acgagaatcg ctgtcctctc caacgagcca 60 
agg 63 



<210> 18 

<211> 26 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a linking sequence of a linker DNA used in Example 3 
<400> 18 

ccttggctcg tttttttttg caaaaa 26 

<210> 19 

<211> 25 

<212> DNA 

<213> Mus musculus 

i 

<220> 

<223> a linker specific primer for use in the first round 
in Example 3 (forward) 
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<400> 19 

cgaatcgtaa ccgttcgtac gagaa 25 



<210> 20 

<211> 30 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a linker specific primer for use in the first round 
in Example 3 (reverse) 

<400> 20 

gagatgcatg ctttgcatac ttctgcctgc 30 



<210> 21 

<211> 25 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a linker specific primer for use in the second round 
in Example 3 (forward) 

<400> 21 

tcgtacgaga atcgctgtcc tctcc 25 



<210> 22 

<211> 30 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a linking sequence of neo cassette specific primer for use 
in the second round in Example 3 (reverse) 

<400> 22 

ggagcctggg gactttccac acctggttgc 30 



<210> 23 

<211> 30 

<212> DNA 

<213> Mus musculus 

<220> 

<223> an alternative linking sequence of neo cassette specific primer 
for use in the second round in Example 3 (reverse) 

<400> 23 
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ggggagcctg gggactttcc acaccctaac 30 



<210> 24 

<211> 39 

<212> DNA 

<213> Gallus gallus 

<220> 

<223> a primer 5' upstream until the transcription initiation site of 
chicken beta-actin promoter used in Example 4 

<400> 24 

gcaatgcggc cgcattgatt attgactagt tattaatag 3 9 



<210> 25 

<211> 39 

<212> DNA 

<213> Gallus gallus 

<220> 

<223> a primer 3' of chicken beta-actin promoter used in Example 4 

<400> 25 

cgagaaaaac cgcccgccgc gcgcttcgct ttttatagg 3 9 



<210> 26 

<211> 40 

<212> DNA 

<213> Gallus gallus 

<220> 

<223> an alternative primer 3' of chicken beta-actin promoter used 
in Example 4 

<400> 26 

cgagaaaaac cccgcccgcc gcgcgcttcg ctttttatag 40 



<210> 27 

<211> 36 

<212> DNA 

<213> Mus musculus 



<220> 

<223> a primer of the 5' upstream from the 5' terminus of the R region o 
the IAP to the downstream of the U5 region used in Example 4 

<400> 27 

cgcggcgggc ggtttttctc gctctcttgc ttcttg 36 
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<210> 28 

<211> 30 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a primer of the 3 f side from the 5' terminus of the R region of the 
IAP 

to the downstream of the U5 region used in Example 4 
<400> 28 

tctgaaatga agtatccctc ctgcgccagt 30 



<210> 29 
<211> 36 
<212> DNA 

l 

<213> Mus musculus 

■ 

<220> 

<223> an alternative primer of the 3' side from the 5' terminus of 

the R region of the IAP to the downstream of the U5 region used 
in Example 4 

<400> 29 

cggcgggcgg ggtttttctc gctctcttgc ttcttg 36 



<210> 30 

<211> 903 

<212> DNA 

<213> Mus musculus 

<220> 

<223> gamma globin intron sequence 
<400> 30 

gtgagtccag gagatgtttc agcactgttg cctttagtct cgaggcaact tagacaactg 60 

agtattgatc tgagcacagc agggtgtgag ctgtttgaag atactggggt tgggggtgaa 12 0 

gaaactgcag aggactaact gggctgagac ccagtggcaa tgttttaggg cctaaggaat 180 

gcctctgaaa atctagatgg acaactttga ctttgagaaa agagaggtgg aaatgaggaa 24 0 

aatgactttt ctttattaga tttcggtaga aagaactttc acctttcccc tatttttgtt 300 

attcgtttta aaacatctat ctggaggcag gacaagtatg gtcgttaaaa agatgcaggc 360 

agaaggcata tattggctca gtcaaagtgg gggaactttg gtggccaaac, atacattgct 420 

aaggctattc ctatatcagc tggacacata taaaatgctg ctaatgcttc attacaaact 4 80 

tatatccttt aattccagat gggggcaaag tatgtccagg ggtgaggaac aattgaaaca 54 0 

tttgggctgg agtagatttt gaaagtcagc tctgtgtgtg tgtgtgtgtg tgtgtgtgtg 600 

tgtgtgtgcg cgcacgtgtg tttgtgtgtg tgtgagagcg tgtgtttctt ttaacgtttt 660 

cagcctacag catacagggt tcatggtggc aagaagataa caagatttaa attatggcca 720 

gtgactagtg ctgcaagaag aacaactacc tgcatttaat gggaaagcaa aatctcaggc 780 

tttgagggaa gttaacatag gcttgattct gggtggaagc tgggtgtgta gttatctgga 840 

ggccaggctg gagctctcag ctcactatgg gttcatcttt attgtctcct ttcatctcaa 900 

cag 903 



24 



<210> 31 

<211> 15 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a sequence of the tRNA binding site of the full length IAP 

<400> 31 

tccgggacga gaaaa 15 

<210> 32 

<211> 15 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a repeat sequence of the R region of the full length IAP 



<210> 33 

<211> 17 

<212> DNA 

<213> Mus musculus 

<220> 

<223> a specific sequence for the full length IAP (tRNA binding site) 

<400> 33 



<400> 32 
ttgcttcttg ctctc 



15 



tggtgccgaa ttccggg 



17 



> 



<210> 
<211> 
<212> 
<213> 



34 
DNA 

Mus musculus 



<220> 
<223> 



a tandem repeat sequence specific for the full length IAP 



<400> 34 
aatccgggac gagaa 



15 



<210> 
<211> 
<212> 
<213> 



35 
11 
DNA 



Mus musculus 
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<220> 

<223> a repeat sequence of the R region found in the full length IAP 

<400> 35 
ttgcttcttg c 



11 



<210> 36 

<211> 378 

<212> DNA 

<213> Mus musculus 

<220> 

<-22 3> cytomegalovirus (CMV) enhancer sequence 
<400> 36 

attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 60 

atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 12 0 

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 180 

tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag 240 

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 300 

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 360 

tcatcgctat taccatgg 378 



37 
30 
DNA 

Artificial 



synthetic sequence in the sense direction of 1st primer used 
in Example 8 

<400> 37 

agggctgcgg caagggcaac atcctgttcg 30 

t 

<210> 38 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic sequence in the antisense direction of 1st primer used 
in Example 8 

<400> 38 

gccgccgtcc tccacgtagg tcttctccag 30 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



<210> 39 
<211> 30 
<212> DNA 



i 
I 



I 
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<213> Artificial 
<220> 

<223> synthetic sequence in the sense direction of 2nd primer used 
in Example 8 

<400> 39 

ggcaaccagc tggtgcagat ccgcgtgacc 3 0 



<210> 40 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic sequence in the antisense direction of 2nd primer used 
in Example 8 

<400> 40 

gtccttcacc acgcccttgc tcttcatcag , 30 

<210> 41 
<211> 37 
<212> DNA 
<213> unknown 

<220> 

<22 3> the sequence of a junctional portion between the 
CMV promoter and the R region 

<400> 41 

gagctcgttt agtgaaccgt ttttctcgct ctcttgc 37 

<210> 42 
<211> 36 
<212> DNA 
<213> UNKNOWN 

i 

<220> 

<223> the sequence of a juncture site of CA promoter 



<400> 42 

gaagcgcgcg gcgggcggtt tttctcgctc tcttgc 36 

<210> 43 
<211> 38 
<212> DNA 
<213> unknown 

<220> 

<223> the sequence of a juncture site of CA promoter 
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<400> 43 

gaagcgcgcg gcgggcgggg tttttctcgc tctcttgc 38 

<210> 44 
<211> 60 
<212> DNA 
<213> UNKNOWN 

<220> 

<223> Sequence of the GAG gene preferable for 
transposition 

<400> 44 

atgaattcag aacttttcac gtggggaacg agagtaccag tgagtatgtt tggccttgaa 60 



<210> 45 
<211> 20 
<212> PRT 
<213> UNKNOWN 

<220> 

<223> Sequence of the GAG gene preferable for 
transposition 

<400> 45 

Met Asn Ser Glu Leu Phe Ser Trp Gly Thr Arg Val Pro Val Ser Met 

1 *5 10 15 

Phe Gly v Leu Glu 

20 

<210> 46 
<211> 15 
<212> DNA 
<213> unknown 

<220> 

<22 3> repeat sequence 
<400> 46 

tccgggacga gaaaa 15 

"> 

<210> 47 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> repeat sequence 
<400> 47 

ttgcttcttg ctctc 15 



